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ABSTRACT 

During the past few years considerable insight has been gained into the QUEST 
algorithm both as a maximum-likelihood estimator and as a Kalman filter/smoother 
for systems devoid of dynamical noise. This conference contribution describes the 
new algorithms and software and makes analytic comparisons with the more conven- 
tional attitude Kalman filter. We also describe how they may be accommodated to 
noisy dynamical systems. 


Introduction: the QUEST Algorithm 

The QUEST algorithm is based on a least-square problem first proposed in 1965 by Grace 
Wahba, then a graduate student in Statistics at George Washington University and working 
during that summer for IBM in Gaithersberg, Maryland. The problem, which appeared in 
SIAM Review [ 1 ], was, in fact, Wahba’s first publication. In it she posed the problem of finding 
the attitude which minimizes the loss function 


i ( j4 ) = |S a i |W,_ AVJ 2 , 


( 1 ) 


i=i 


where W i( i = 1, . . . , n, are a set of unit-vector observations in the spacecraft-fixed reference 
frame, and V,, i = 1, . . . , n, are the representations of the same unit vectors with respect 
to the primary reference frame (the frame to which the attitude is referred). The a { are a set 
of non-negative weights. Provided that at least two of the observation vectors are not parallel 
(or anti-parallel) and the corresponding weights are positive, a unique minimizing attitude 
matrix will always exist. Dozens of solutions have been proposed to find this attitude matrix, 
of which the fastest currently and most frequently used is the QUEST algorithm [ 2 ], based on 
the ^-algorithm of Davenport [ 3 ]. 
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Tb solve for the optimal attitude we first write equation (1) in the form 

£04) = E«,-2>W,--4V,, (2) 

» = 1 1=1 

n 

= 53 a i ~ 9 W • (3) 

«=1 

The gain function, g(A), may be further manipulated to give 

g{A) = tr [B t A) , (4) 

where B, the attitude profile matrix, is given by 

*=X> ( 5 ) 

i=l 

The minimization of L{A) is equivalent to the maximization of g(A). 

We now note that g{A) is linear in A. Nonetheless, the minimization of g(A) is not simple 
because the 3x3 matrix A is subject to six nonlinear constraints. Thus, the minimization 
of g(A) over A is not necessarily simple. 1 The attitude matrix, however, can be written as a 
quadratic function of the quaternion, 

? = [?i> ?3,9 4 ] T = [^j . (6) 

namely, 

(7) 

( 8 ) 

S = B + B t , s = trB, [[Z ]] = B-B T , (9) 

the gain function may be rewritten in terms of the quaternion as 

g(q) = g(A(q)) = q T K q , (10) 

where 

S — sI 3x3 Z 

K = 

Z T s 

’Not all students of the Wahba problem will agree, as shown by Markley [4]. 



M.i) — (<?4 ~ 9 ■ 9)73x3 + 2qq T -f 2 q 4 [[ q ]] , 

where 

r o ? 3 -?2 

[[q]]= -93 0 9i ■ 

. — 9l 0 

Defining further the quantities 
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The maximization of this gain function, subject to the constraint that the quaternion have 
unit norm, leads to an eigenvalue equation for g*, the optimal quaternion, which is [2, 3 ] 


Kq' 




( 12 ) 


where A max is the largest eigenvalue of K. Thus, the optimal quaternion may be found by 
solving this 4x4 eigenvalue problem and choosing the eigenvector with the largest eigenvalue. 
This is Davenport’s g-method, which was applied in this form to the HEAO mission [5]. 

The QUEST algorithm, a very fast implementation of Davenport’s g-method which avoids 
the complete solution of the eigenvalue problem, is formulated in terms of the Gibbs vector, 
Y, 

Y = q/g 4 . (13) 

In terms of the Gibbs vector the optimal attitude may be written as 

Y* = [(A max + s) I 3x 3 — S'] 

and the optimal quaternion then reconstructed as 


Z, 


(14) 


<7* = 


v"i + [Y*| 2 L 1 J 


(15) 


Key to the QUEST algorithm is the fact that a very good first approximation of the optimal 
attitude (accurate to 0(a 4 ), where a is the standard deviation of a typical sensor error) may 
be obtained by substituting A^ x for A max , with 


A mix = Y, a i ■ 


(16) 


i=l 


It is easy to show that 

^, = ^«(l + 0(^)). (17) 

The further refinement of A max is described in detail in [2]. This amounts to solving the equa- 
tion 

A m „ = s + Z T z ' 0 s ) 

by the Newton-Raphson method using A^ x as a starting value. 

If the measurements are assumed to be corrupted solely by Gaussian random errors of the 
form, 

W, = T V, + AW, , (19) 

where the sensor error AW f satisfies 

£{AW,} = 0, 

E{ AW, AWf } = a] [/ 3x3 - (A V { ) (A V,) T ] , 
and the weights a it i = 1, . . . , n, are chosen so that 


( 20 ) 

( 21 ) 

( 22 ) 
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for some constant c, then Reference [2] shows that the attitude covariance matrix is given by 


Pee — 


E^2 (4x3-(W,-) lrue (W ,.)£,«) 

t = l 1 


-1 


(23) 


where 

(W,) true = Aru«V,-. (24) 

In actual computation we generally substitute W, for (W,) true , since the latter value is not 
known, in general. The attitude covariance matrix is defined here as 


P ee = Cov(A0), (25) 

where A0, the attitude error, is given by 

^Le*/ 3 X3 + [[A0]], (26) 

and Cov denotes the covariance. Thus, the QUEST algorithm gives a fast direct method for 
constructing the optimal attitude. The algorithm has other valuable properties as well, which 
are discussed in [ 2 ]. 


The Attitude Kalman Filter for the QUEST Model 

QUEST is a batch estimator taking as input a collection of simultaneously measured unit 
vectors. When the data is not simultaneous and we wish to use data at widely different times, 
the algorithm of choice has been the Kalman filter. In the present section we present the 
Kalman filter for the measurement model of equations (19)— (21). 

Since the QUEST algorithm does not treat dynamical noise (the measurements being all 
simultaneous, this would hardly be relevant), we examine the Kalman filter for a system with- 
out dynamical noise, that is, a system for which the temporal development of the attitude is 
described by 

^ = (27) 

where the transition matrices, k = 0, . . . , N - 1, is known perfectly. In general, the 
subscript k will indicate the time, and the subscript i will indicate the sensor. For such a system 
the prediction of the attitude matrix must have the form 

= — 1 |Ar — 1 > (28) 

where is the estimate of the attitude matrix at time t k _ l based on all the measure- 
ments up to that time inclusively, and is the estimate of the attitude matrix at time t k 

based on the same data. Since dynamical noise is absent, the prediction of the attitude covari- 
ance matrix is given by 

= • (29) 

and, since no confusion can result, we have dropped the subscript 60 to make the notation less 
cumbersome. 
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For updates the calculation is more involved. Since the attitude has only three free pa- 
rameters while the attitude matrix has nine, we do not update the attitude matrix directly but 
compute instead the updated value of £ k , the incremental rotation vector, which is defined by 


A* = e 1[e ‘ 1] 

(30) 


(31) 

so that by definition 

Ck\k - 1 = 0. 

(32) 

Then we can write the linearized measurement as 


C*sW*-W fclfc _ 1? 

(33) 

where 

(34) 

Combining equations (31)-(34) yields 


Cjt = H; k £ k + v k , 

(35) 

where 

(36) 


The measurement noise of our linearized measurement, v fc , is assumed to be Gaussian and 
zero-mean. Its covariance matrix can have only rank 2 since unit-vector measurements have 
only two degrees of freedom. However, it can be shown that the true covariance matrix of v k 
can be replaced by 

Rk = a kl 3x3 > (37) 

which is obviously of rank 3. This substitution leads to the same estimates and covariance 
matrices as the form given by equation (21) [6]. The reason for this is that the additional 
noise which makes the covariance matrix of rank 3 is along the direction of W*., to which the 
attitude is not sensitive. It can be seen from equation (36) that H k annihilates that component 
from W fc . 

The Kalman filter update equations now become 



— Hk Pk\k-\P k + P-k 1 

(38) 

K k 

= Pk\k-\Hl ^k 1 

(39) 

Vk \k 

= K k W k , 

(40) 

p 1 fclfc 

= (-^3x3 “ KkHk)Pk\k-l 

(41) 


= (7 3x3 - KkHk)Pk\k-i(h* 3 - K k H k) T + K k R k H k . 

(42) 


The Kalman filter equations, (28) through (42), can treat non-simultaneous data but are con- 
siderably more complicated than the QUEST algorithm for simultaneous data. It is natural 
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ask, therefore, whether the QUEST equations can be manipulated to remove the restriction to 
simultaneous data. The answer is affirmative. In fact, the Wahba problem was applied to non- 
simultaneous quite some time ago [ 5 ] but in a batch framework, not in a sequential framework 
like the Kalman filter. 


The Sequentialization of QUEST: Filter QUEST 

Suppose that we have a set of simultaneous measurements at time which we can denote 

by * = 1, . . . , n, and let us denote the optimal attitude at time t k _ 1 computed using 

the QUEST algorithm by AJ_ 1 | Jk _ 1 . Recalling equation (27), the optimal value of A k based 
on the data at time t k _ t is obtained by minimizing 




»=1 


(43) 


where the additional subscript on L(A ) indicates the time of the data. Since $ k is orthogonal, 
this is clearly the same as finding the value of A k which minimizes 

j ^-1 

= 2 £ «.,*-! - V . (44) 

1 

that is, by replacing by , or equivalently, noting equation (5), by replacing 


n*-l 

B k-i\k-i ■ 2 “M-1 W i|Jk _ 1 (45) 

1*1 

by 

1 

B k\k-l — (46) 

1=1 

Thus, for the filter version of QUEST the prediction step becomes simply [7] 

B k\k-\ *= $*-i j5fc-i|jt-i • (47) 

We may, in fact, drop the distinction between the indices i and k and treat each unit vector has 
having a distinct time t k , reference vector V k and weight a k associated with it. If two vector 
measurements W fc+1 and W* are simultaneous, then t k+1 = t k and = / 3x3 . 

For the update step of Filter QUEST w® note that when we increase the number of mea- 
surements in the measurement set of equation (5) we simply add a term, a k W* Vjf, to B. Thus, 
the update step in terms of the attitude profile matrix is 

B k\k = 5 *|*-i + a k'W k V k . (48) 


The QUEST algorithm requires also that we know the value of separately. This is given 
by 


^(°) \(°) 

''max* ''max *- 


+ a* 


(49) 
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Equations (47)-(49) are clearly much simpler than the corresponding Kalman filter equa- 
tions (28)-(42). . ■ ■ . 

The covariance matrix can be computed sequentially also by first computing the attitude 

information matrix, F = P~ l . The relevant equations are 

F k\k-i = $k-i F k-i\k-i®l-\ 

F k]k = F k\k-i + i (^xa-WfcW?) . 
u k 

These can be computed likewise [7, 8] from 

F = tr (A* B t )Im - A* B t , 

without the need to have a separate recursion relation for F. Equation (52), in fact, is very 
important because it can be solved for B to yield 

B=(la(F)I 3 „-F^A\ ( 53 ) 

Thus, given initial values, A* 0 ^ and P 0 (o , the initial value of the attitude profile matrix, B a (<J can 
be computed from equation (53). This last fact makes the analogy of Filter QUEST with the 
Kalman filter complete. In fact, since it can be shown that QUEST is a maximum-likelihood 
estimator for Gaussian errors [7], the Kalman filter and Filter QUEST will yield identical 
attitude estimates for the attitude system considered above. 

It is well to note that the Filter QUEST is an information filter rather than a covariance 
filter. This is made clear by the fact that B^ is a meaningful quantity even though the attitude 

cannot be calculated from a single measurement. 

While the prediction and update equations of Filter QUEST are simple, it is also true that 
the need to apply the part of QUEST which computes the optimal attitude and covariance ma- 
trix from the attitude profile matrix is an additional computational burden. The real advantage 
of Filter QUEST comes when one does not require an attitude solution at every measurement 
update. In this case, the efficiency of Filter QUEST relative to the Kalman filter is greatly 
enhanced. 

The Treatment of Noisy Processes 

In general, attitude systems are subject to random torques, or the dynamical equations are 
replaced by the gyro equations [9] so that the gyro measurement noise becomes process noise. 
In the Kalman filter formulation, this extra complication results in the prediction equation for 
the attitude being replaced by ' , 

F k\k-\ - $k-i F k -i\k-i®l-i + Qk - 1 • 

where Q k _ x is the covariance of the accumulated process noise from time t k _ 1 to time t k . 
Possibly also, the state vector must be augmented to include the state of the Markov process 
driving the gyro errors [9]. 


(50) 

(51) 

(52) 
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Such an enhancement is not possible in Filter QUESTsince there is no simple way of adding 
process noise to B. One could, of course, compute P^ k _i after every prediction step of Filter 
QUEST; add the process noise covariance to P k | Jfe _ 1 , and then use equation (53) to compute 
a new B k This would be extremely burdensome and destroy whatever computational ad- 
vantage Filter QUEST offered. 

An approximate way to simulate the treatment of process noise is to modify Filter QUEST 
so that it becomes a fading memory filter. Thus, we replace the prediction step in QUEST by 


-#*-i|k-i > (55) 

where a k is a number between zero and one, and which is also a function of k. Clearly, if a 
is chosen to be zero, then Filter QUEST will have no memory at all. If the data consists of a 
sequence of frames each containing several simultaneous vector measurements, then choosing 

= 0 at the end of each frame and a k = 1 otherwise will produce a sequence of single-frame 
QUEST estimates. Choosing a k = 1 for all k corresponds to infinite memory, which would be 
appropriate for a genuinely noiseless system. 

How should one choose a? Clearly, if the accumulated process noise between measure- 
ments is generally much smaller than the measurement noise, then it should be expected that 
Filter QUEST properly adjusted will average several measurements and obtain a much more 
accurate result than the single-frame estimate. If a k is adjusted to be too small, then Filter 
QUEST will take insufficient advantage of the data and the result will be less accurate. Like- 
wise, ifa k is too large, then the Filter will overweight data which has become less accurate due 
to the accumulation of process noise, and the solution will be less accurate again. Thus, there 
is generally an optimal choice for a k . 

Let us consider the case where the process noise is equivalent to 


A, = e ll»*-ill* 




(56) 


where w k is a white sequence with covariance g/ 3x3 . Such a model is characteristic of an ideal- 
ized laser gyro and would be appropriate if the dynamical information were coming from laser- 
gyro measurements. Let us consider also that the spacecraft is equipped with three attitude 
sensors which at each time t k sense simultaneously unit vectors along each of the coordinate 
axes, each with an accuracy of a. In such a case, clearly, we would choose a k — 1 between 
the unit-vector measurements in each frame and a k = a after the last measurement in each 
frame. In this case, an analytical solution is possible for the covariance matrix of the QUEST 
filter, which in the limit that an infinite number of measurements have been processed turns 
out to be 

tt* /> n 

(57) 


pQF _ QF , 
i k|k ~ Pk\k I 3x3 » 


with 


and 


p QF = — 

Pk\k 2 


1 — a 

[l + a 


+ 


2 a 2 1 

x 1 - a 2 J ’ 


This function is a minimum for 


a opt = 


x = a 2 /q . 


X + 1 — y/l -f- 2X 


(58) 


(59) 


(60) 


132 



New QUEST® for Better Attitude® 



Figure 1, Filter QUEST Covariance as a 
Function of the Fading Memory Factor for o 2 « q 


If we choose values such as 

o = \ deg , q = (.5 deg) 2 , ( 61 ) 

then a plot of will look like Figure 1, which shows a broad minimum at a = 0.5 and a 
minimum variance of p% = (.5 deg) 2 . This should be compared with the single-frame result 
which is Q 


pSingle-frame _ _ _ (.707 deg) 2 
2 


(62) 


The Filter QUEST solution is not a very large improvement over the single frame solution but 
not inconsistent with the relatively large gyro noise we have chosen compared to the vector- 

sensor noise. „ , r 

The general formula for the minimum Filter QUEST variance for this example as a function 

of o and q is , 

„QF _ a l . (63) 


«.«(+) = 


Thus, 

as expected, and 


T3 QF 
'min k\k 


_ ~ jingle-frame as x 
2 1 


o, 


» QF 
P min k\k 


o 2 [2 

2 Vi 


as x 


00 , 


(64) 

(65) 
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alpha 


Figure 2. Filter QUEST Covariance as a 
Function of the Fading Memory Factor for <r 2 »q 

Equation (65) shows that for x very large, the effective number of previous measurements 
which Filter QUEST is averaging to reduce the error is 

N efi = x/2- ( 66 ) 

Note that the dependence of a opt on the measurements is in the limiting cases 

“opt ~ + ® as x * 0 , (67) 

and 

“opt -» 1 - as X -> °o . (68) 

Th u s> o 0p j will generally be extremely close to unity for cases where we would generally want 
to use a filter. For example, if we choose instead of the previous case more physical values 
such as tr = 1 deg, q = (l arc min) 2 , then a opt = .976 and p^ k = (.n deg) 2 , The standard 
deviation is, thus, almost seven times smaller than the single-frame value. The dependence of 
the variance on the fading-memory parameter for this case is shown in Figure 2. 
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Smoother QUEST 

The Kalman filter has the disadvantage that only anterior data is used in the estimate. Thus, 
posterior data, which is equally accurate, is not considered, thereby increasing the covariance 
by at least a factor of 2 over its achievable value. Also, for early estimates, less data is used 
leading to a less accurate result than for later estimates. 

A Kalman filter/smoother uses both the data which precedes the time of the estimate and 
the data which follows the time of the estimate. While such an estimator is more accurate, it 
has the disadvantage that it cannot function in real-time. Thus, a smoother is generally more 
applicable to background processing on the ground rather than real-time processing on the 
spacecraft. 

The QUEST algorithm also admits a smoother implementation. Suppose we are given mea- 
surements W k , k = 1 ,... , jV. Then the smoothed attitude profile matrix B k at time t k , 
k = 0, . . . , W, is given by 


Bk\. N — ak ^ 11 ^ B o]o 


1 = 1 


+ a k W k V T k 



(69) 


i=k + 1 


The first term in this equation is the contribution of the a priori estimate of the attitude. If 
the smoother were implemented in segments, B 0 1 0 would be the attitude profile matrix for the 
final estimate of the previous segment. The second term gives the predicted contributions of 
the measurements preceding the current measurement. The third term is the current mea- 
surement. The first three terms thus constitute the usual Filter QUEST expression for the 
attitude profile matrix. The fourth term gives the contribution from the measurements which 
come after the time of the estimate. The factors of the transition matrices transform the mea- 
surements to the body frame at time t k and the factors of downgrade the data to reflect 
the ravages of process noise. Equation (70) may be rewritten as 

B k\N - Bk\k + Dk » k = 0,...,N, (70) 

where B k j fc is the “filtered” attitude profile matrix, which satisfies the previous Filter QUEST 
(forward) recursion relations, and is given by the first three lines of equation (69). D k is the 
contribution of the posterior measurements, which is given by the last line of equation (68). 
By inspection, we see that D k satisfies a backward recursion relation, 

D n = 0, (71) 

= + , (72) 

in complete analogy to the usual Rauch-Tung-Striebel Kalman filter/smoother [10]. The in- 
formation matrix again is given by equation (52) but with B k \ N replacing B k \ k . Since $ k in 
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the present application is orthogonal, the inverse is given by the transpose. Thus, the set of 
smoothed attitude estimates for an interval of data is obtained with only twice the computa- 
tional burden of the calculation of the filtered estimates. An obvious drawback, however, is 
that all of the data, filtered attitude profiles matrices, and attitude transition matrices must be 
stored. Thus, it is beneficial to process overlapping segments (but whose data length is much 
greater than N eff ) in order to keep storage requirements for the processing within reason. 


Discussion 

Despite its simplicity and obvious power in the above example. Filter QUEST has its draw- 
backs. First, it only estimates attitude. Thus, in a system in which angular velocity or gyro 
biases must also be estimated, Filter QUEST will not be sufficient. Also, for systems with poor 
geometries, say only a single measurement, Filter QUEST’S approximation of a single fading- 
memory factor may be inadequate. Also, Filter QUEST suffers from the short-comings of 
QUEST; which, if viewed as a maximum-likelihood estimator, effectively assumes the mea- 
surement error model given by equations (19) — (21). This is not always the case. However, it is 
frequently so, and for the most part this model is reasonable, and Filter QUEST offers a useful 
if limited alternative to the full Kalman filter. In one recent example Filter QUEST has been 
applied to the COBE mission with encouraging, if not spectacular, results [ 11 ]. 
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